Претрага
68 items
-
Development of Open Educational Resources (OER) for Natural Language Processing
In this paper we present the development of an online course at the edX BAEKTEL platform named “Lexical Recognition in the Natural Language Processing (NLP)”. It is based on the course of the same name for PhD studies at the University of Belgrade, Faculty of Philology. There are not many courses in Computational Linguistics (CL) on OER platforms, and there is none in Serbian either for CL or NLP. We have developed this course in order to improve this ...... 2. COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING Computational linguistics (CL) is a theoretical discipline between linguistics and computer science concerned with understanding and modelling the written and spoken language from a computational aspect.[3]Natural ...
... There are not many courses in Computational Linguistics (CL) on OER platforms, and there is none in Serbian either for CL or NLP. We have developed this course in order to improve this situation as it can prove useful both for linguists working in corpus linguistics and computer scientists developing ...
... knowledge to use the existing resources for NLP for Serbian and to develop new ones. Keywords: E-Learning, Open Educational Resources, Computational Linguistics, Lexical Resources, edX 1. INTRODUCTION Open educational resources (OER) publicly available on the web are growing quickly and are ...Cvetana Krstev, Biljana Lazić, Ranka Stanković, Giovanni Schiuma, Miladin Kotorčević. "Development of Open Educational Resources (OER) for Natural Language Processing" in The Sixth International Conference on e-Learning (eLearning-2015), September 2015, Belgrade, Serbia, Belgrade : Belgrade Metropolitan Univesity (2015)
-
Веб-алат за управљање грађом Речника САНУ и анотација листића
Грађа на основу које се израђује Речник српскохрватског књижевног и народног језика САНУ, а која садржи материјал из преко 4.500 писаних извора и 300 рукописних збирки речи са подручја народних говора штокавског наречја, забележена је на око 5.000.000 листића. Богат лексички материјал, који обухвата књижевни и народни језик у протекла два века и на основу кога треба да се напише још најмање 15 томова Речника, пружа могућност и за разноврсна лингвистичка и ванлингвистичка истраживања. Из тог разлога се приступило ...Рада Стијовић, Ранка Станковић, Михаило Шкорић. "Веб-алат за управљање грађом Речника САНУ и анотација листића" in Rasprave Instituta za hrvatski jezik i jezikoslovlje, Institute of Croatian Language and Linguistics (2020). https://doi.org/10.31724/rihjj.46.2.32
-
EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School
Prva škola za obuku polaznika koju je organizovala COST akcija NexusLinguarum održana je od 8. do 12. februara 2021. godine sa ciljem da studenti, istraživači i stručnjaci nauče osnove lingvističke nauke o podacima. Tokom obuke polaznici su se upoznali sa širokim spektrom tema: od semantičkog veba, RDF -a i ontologija, do modeliranja i pretraživanja jezičkih podataka pomoću najsavremenijih ontoloških modela i alata. Škola je održana u okviru serije letnjih škola EUROLAN-a i organizovalo ju je virtuelno (onlajn) nekoliko instituta; ...nauka o lingvističkim podacima, povezani podaci u lingvistici, jezički podaci, EUROLAN, NexusLinguarum, COST akcija, škola za obuku... and covers topics that are particularly relevant to the fields of computational linguistics and natural language processing (NLP). The goal of this 15th EUROLAN School was to bring together scholars, teachers and students of linguistics, NLP and information technology to discuss the principles and best ...
... Introduction to Linked Data for Linguistics Online Training School Milan Dojchinovski, Julia Bosque Gil, Jorge Gracia, Ranka Stanković Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School | ...
... employees' publications. - The Repository is available at: www.dr.rgf.bg.ac.rs Professional paper EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School UDC 81: 37.018.53 DOI 10.18485/infotheca.2021.21.1.7 ABSTRACT: The first training school or- ganized by the NexusLinguarum ...Milan Dojchinovski, Julia Bosque Gil, Jorge Gracia, Ranka Stanković. "EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.1.7
-
Frequency and Length of Syllables in Serbian
Marija Radojičić, Biljana Lazić, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Ján Mačutek, Lívia Leššová (2019)Basic analyses of several properties of syllables (the rank-frequency distribution, the distribution of length, and the relation between length and frequency) in Serbian is presented. The syllabification algorithm used combines the maximum onset principle and the sonority hierarchy. Results indicate that syllables behave similarly to words as far as mathematical models are concerned, but values of parameters in models for syllables are quite different from those for words.... Issues in Quantitative Linguistics 2: 136-147. Lüdenscheid: RAM-Verlag. Frequency and Length of Syllables in Serbian 123 Mačutek, J., Wimmer, G. (2013). Evaluating goodness-of-fit of discrete distribution models in quantitative linguistics. Journal of Quantitative Linguistics 20, 227-240. Meštrović ...
... Press. Crystal, D. (2008). A Dictionary of Linguistics and Phonetics. Oxford: Blackwell. Eddington, D., Treiman, R., Elzinga, D. (2013a). Syllabification of American English: Evidence from a large-scale experiment. Part I. Journal of Quantitative Linguistics 20, 45-67. Eddington, D., Treiman, R. ...
... bivariate model. Journal of Quantitative Linguistics 20, 241-251. Koščová, M., Mačutek, J., Kelih, E. (2016). A data-based classification of Slavic languages: Indices of qualitative variation applied to grapheme frequencies. Journal of Quantitative Linguistics 23, 177-190. Köhler, R. (2005). Properties ...Marija Radojičić, Biljana Lazić, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Ján Mačutek, Lívia Leššová. "Frequency and Length of Syllables in Serbian" in Glottometrics (2019)
-
A Method for Extracting Translational Equivalents from Aligned Texts
Obradović Ivan (2013)Obradović Ivan. "A Method for Extracting Translational Equivalents from Aligned Texts" in Methods and Applications of Quantitative Linguistics, Selected papers of the 8th International Conference on Quantitative Linguistics (QUALICO) in Belgrade, Serbia, April 26-29, 2012, Ivan Obradović, Emmerich Kelih, Reinhard Köhler (eds.), :University of Belgrade & Academic Mind (2013): 119-129
-
Quantitative analysis of syllable properties in Croatian, Serbian, Russian, and Ukrainian
Biljana Rujević, Marija Kaplar, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Jan Mačutek (2021)Biljana Rujević, Marija Kaplar, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Jan Mačutek. "Quantitative analysis of syllable properties in Croatian, Serbian, Russian, and Ukrainian" in Language and Text: Data, models, information and applications, John Benjamins Publishing Company (2021). https://doi.org/10.1075/cilt.356.04ruj
-
E-Dictionaries and Finite-State Automata for the Recognition of Named Entities
Krstev Cvetana, Vitas Duško, Obradović Ivan, Utvić Miloš. "E-Dictionaries and Finite-State Automata for the Recognition of Named Entities" in Proceedings of the 9th International Workshop on Finite State Methods and Natural Language Processing, FSMNLP 2011, July 2010, Blois, France, A. Maletti and M. Constant (eds.), :Association for Computational Linguistics (2011): 48-56
-
Corpus-based bilingual terminology extraction in the power engineering domain
Ovaj rad predstavlja resurse i alate koji se koriste za ekstrkciju i evaluaciju dvojezične, englesko-srpske terminologije u domenu energetike. Resursi se sastoje od postojeće opšte i domenske leksike i domenskog paralelnog korpusa; alati uključuju ekstraktore termina za oba jezika i alat za poravnavanje segmenata koji pripadaju korpusnim rečenicama. Sistem je testiran variranjem funkcije podudaranja koja utvrđuje prisustvo ekstrahovanog termina u poravnatom segmentu (odsečak), u rasponu od veoma labavog do strogog. Procena rezultata je pokazala da je preciznost izdvajanja termina ...Tanja Ivanović, Ranka Stanković, Branislava Šandrih Todorović, Cvetana Krstev. "Corpus-based bilingual terminology extraction in the power engineering domain" in Terminology, John Benjamins Publishing Company (2022). https://doi.org/10.1075/term.20038.iva
-
Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++
Branislava Šandrih, Ranka Stanković (2020)U nauci, industriji i mnogim istraživačkim oblastima, terminologija se brzo razvija. Najčešće, jezik koji je „lingua franca“ za većinu ovih oblasti je engleski. Kao posledica toga, za mnoga polja termini domena su koncipirani na engleskom, a kasnije se prevode na druge jezike. U ovom radu predstavljamo pristup za automatsko izdvajanje dvojezične terminologije za englesko-srpski jezički par koji se oslanja na usaglašeni dvojezični korpus domena, ekstraktor terminologije za ciljni jezik i alat za usklađivanje delova. Ispitujemo performanse metode na domenu ...... Processing and Computational Linguistics (CICLING2016), LNCS. Springer, 2016 Hazem, Amir and Emmanuel Morin. “Efficient Data Selection for Bilin- gual Terminology Extraction from Comparable Corpora”. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical ...
... Expressions from Small Parallel Corpora”. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters, COLING ’10, 1256– 1264. Stroudsburg, PA, USA: Association for Computational Linguistics, 2010. http://dl.acm.org/citation.cfm?id=1944566.1944710 Vintar, Špela and Darja Fǐser ...
... . In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, Vol. 1, 48–54. Association for Computa- tional Linguistics, 2003. Infotheca Vol. 19, No. 2, December 2019 135 Šandrih B., Stanković R., “Extraction ...Branislava Šandrih, Ranka Stanković. "Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.6
-
Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis
U ovom radu predstavljen je model koji omogućava prikupljanje, pripremu, opis metapodataka, upravljanje i eksploataciju, uključujući pretragu punog teksta dokumenata iz domena kriminalistike napisanih na srpskom jeziku. Predloženi pristup primenjuje se na veb portalu koji sakuplja različite tekstove nastale iz časopisa Akademije za kriminalistiku i policijske studije, Krivičnog zakona Srbije, konferencija „Tara“ i „Reiss“, kao i iz nekih doktorskih disertacija vezanih za ovu oblast istraživanje. Nakon obrade teksta, korpus koji sadrži preko 5500 stranica običnog teksta, kreiran je i ...... on which administrator regulates the Digital Library 3 https://www.gnu.org/licenses/gpl-3.0.en.html http://master-kpa.rgf.rs/ FORENSIC LINGUISTICS The linguistic study of forensic texts is a part of the field of Natural Language Processing, which includes text types classification and ...
... element of revenge in their last moments. Death row statements are within the heavily institutionalized setting of death row prisons. The Forensic Linguistics Institute holds a corpus of these documents and is conducting research on them. And the last but not the least important is social media. Social ...
... enhances the potentials of manipulating each particular resource as well as several resources simultaneously10. 4 John Olsson (2008).. Forensic Linguistics, Second Edition. London: Continuum ISBN 978-0-8264-6109-4 5 Falzini, Mark W. (9 September 2008). "The Ransom Notes: An Analysis of Their Content ...Dalibor Vorkapić, Aleksandra Tomašević, Miljana Mladenović, Ranka Stanković, Nikola Vulović. "Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis" in International Scientific Conference “Archibald Reiss Days” Thematic Conference Proceedings Of International Significance, Belgrade, 7-9 November 2017, Academy Of Criminalistic And Police Studies Belgrade (2017)
-
A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment
Sina Ahmadi, John P McCrae, Sanni Nimb, Fahad Khan, Monica Monachini, Bolette S Pedersen, Thierry Declerck, Tanja Wissik, Andrea Bellandi, Irene Pisani, [...] Ranka Stanković and others (2020)Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment is carried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships such as broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide range of languages ...... In Pro- ceedings of the 21st International Conference on Compu- tational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, pages 105– 112. Association for Computational Linguistics. Niemann, E. and Gurevych, I. (2011). The people’s web meets linguistic knowledge: ...
... disambiguation rivaling supervised systems. In Proceedings of the 48th annual meeting of the asso- ciation for computational linguistics, pages 1522–1531. Association for Computational Linguistics. Pustejovsky, J. (1995). The Generative Lexicon. MIT Press, Cambridge, MA. RAE, R. A. E. (2001). Diccionario ...
... Association for Computational Linguistics, 1:151–164. Matuschek, M. and Gurevych, I. (2014). High perfor- mance word sense alignment by joint modeling of sense distance and gloss similarity. In Proceedings of COL- ING 2014, the 25th International Conference on Compu- tational Linguistics: Technical ...Sina Ahmadi, John P McCrae, Sanni Nimb, Fahad Khan, Monica Monachini, Bolette S Pedersen, Thierry Declerck, Tanja Wissik, Andrea Bellandi, Irene Pisani, [...] Ranka Stanković and others . "A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment" in Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), Marseille, European Language Resources Association (ELRA) (2020)
-
Rule-based Automatic Multi-word Term Extraction and Lemmatization
In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass extracted data to evaluators and subsequently to terminological e-dictionaries and databases. The approach is illustrated on a corpus of Serbian texts from ...... language in logic and the 4th conference on Computational natural language learning -Volume 7, Stroudsburg: Association for Computational Linguistics, pp. 67--72. Smadja, F.(1993). Retrieving Collocations from Text: Xtract, Computational Linguistics, 19(1), pp. 143-177. Stanković, R., Obradović ...
... Springer Berlin Heidelberg, pp. 1160--1165. Church, K. W., Hanks, P., (1990). Word association norms, mutual information, and lexicography, Computational Linguistics, 16, pp. 22--29. Church, K. W. Gale, W., Hanks, P., Hindle, D. (1991). Using statistics in lexical analysis, In U. Zernik (Ed.), Lexical ...
... Engine: ten years on. Lexicography 1(1), pp.7--36. Dunning, T. (1993). Accurate methods for the statistics of surprise and coincidence. Computational Linguistics, 19(1), pp.61--74. Frantzi, K., Ananiadou, S., and Mima, H. (2000). Automatic recognition of multi-word terms:.the C-value/NC-value ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac. "Rule-based Automatic Multi-word Term Extraction and Lemmatization" in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, Portorož, Slovenia, 23--28 May 2016, European Language Resources Association (2016)
-
Two approaches to compilation of bilingual multi-word terminology lists from lexical resources
In this paper, we present two approaches and the implemented system for bilingual terminology extraction that rely on an aligned bilingual domain corpus, a terminology extractor for a target language, and a tool for chunk alignment. The two approaches differ in the way terminology for the source language is obtained: the first relies on an existing domain terminology lexicon, while the second one uses a term extraction tool. For both approaches, four experiments were performed with two parameters being ...Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Two approaches to compilation of bilingual multi-word terminology lists from lexical resources" in Natural Language Engineering, Cambridge University Press (CUP) (2020). https://doi.org/10.1017/S1351324919000615
-
Ontološki model upravljanja rizikom u rudarstvu
Olivera Kitanović (2021)Rudarska proizvodnja obuhvata kompleksne tehnološke sisteme, što nameće potrebu za uspostavljanjem i unapređivanjem sistema upravljanja rizikom. Heterogenost i obim podataka neophodnih za upravljanje rizikom zahtevaju sistem koji ih na fleksibilan način integriše i omogućava njihovo optimalno korišćenje. Osnovni cilj ove disertacije je razvoj ontologije za domen rudarstva i na njoj zasnovanog modela za upravljanje rizikom. Njegova realizacija podrazumeva i implementaciju algoritama ekstrakcije informacija za popunjavanje ontologije, kao i odgovarajuće softversko rešenje. Razvoj modela obuhvata i značajno proširenje rudarskog korpusa, kao ...rudarstvo, rizik, upravljanje rizikom, procena rizika, ontologija, semantička mreža, ekstrakcija informacija, upravljanje znanjem, računarska lingvistika... model includes a significant expansion of the mining corpus, as well as the creation of a terminological database, realized using methods of computational linguistics and a corpus of documents from the mining domain (plans, reports, laws, textbooks and monographs). For systematization of data a descriptive ...
... Framework for Mining Enterprise Risk and Risk Factors from News Documents.” In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: System Demonstrations, 180–84. Osaka, Japan: The COLING 2016 Organizing Committee. https://www.aclweb.org/anthology/C16-2038. Dechow, Patricia ...
... Marti A. 1992. “Automatic Acquisition of Hyponyms from Large Text Corpora.” In Coling 1992 Volume 2: The 15th International Conference on Computational Linguistics. Henley, Ernest J, and Hiromitsu Kumamoto. 1996. “Probabilistic Risk Assessment and Management for Engineers and Scientists.” IEEE Press (2nd ...Olivera Kitanović. Ontološki model upravljanja rizikom u rudarstvu, Beograd : [O. Kitanović], 2021
-
From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)
In this paper we present the wikification of the ELTeC (European Literary Text Collection), developed within the COST Action ``Distant Reading for European Literary History'' (CA16204). ELTeC is a multilingual corpus of novels written in the time period 1840—1920, built to apply distant reading methods and tools to explore the European literary history. We present the pipeline that led to the production of the linked dataset, the novels’ metadata retrieval and named entity recognition, transformation, mapping and Wikidata population, ...Milica Ikonić Nešić, Ranka Stanković, Christof Schöch and Mihailo Škorić. "From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)" in Proceedings of The 8th Workshop on Linked Data in Linguistics within the 13th Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022)
-
Using English Baits to Catch Serbian Multi-Word Terminology
In this paper we present the first results in bilingual terminology extraction. The hypothesis of our approach is that if for a source language domain terminology exists as well as a domain aligned corpus for a source and a target language, then it is possible to extract the terminology for a target language. Our approach relies on several resources and tools: aligned domain texts, domain terminology for a source language, a terminology extractor for a target language, and a ...aligned texts, word alignment, terminology extraction, electronic dictionaries, morphological inflection... n, pages 187–197. Association for Computational Linguistics. Koehn, P., Och, F. J., and Marcu, D. (2003). Statistical Phrase-based Translation. In Proceedings of the 2003 Conference of the North American Chapter of the As- sociation for Computational Linguistics on Human Lan- guage Technology-Volume ...
... Statistical Alignment Models. Computational linguistics, 29(1):19–51. Papineni, K., Roukos, S., Ward, T., and Zhu, W.-J. (2002). BLEU: a Method for Automatic Evaluation of Machine Translation. In Proceedings of the 40th annual meeting on association for computational linguistics, pages 311– 318. Association ...
... parallel corpora. In Pro- ceedings of the 23rd International Conference on Com- putational Linguistics: Posters, COLING ’10, pages 1256–1264, Stroudsburg, PA, USA. Association for Computational Linguistics. Vintar, Š. and Fišer, D. (2008). Harvesting multi-word ex- pressions from parallel corpora ...Cvetana Krstev, Branislava Šandrih, Ranka Stanković. "Using English Baits to Catch Serbian Multi-Word Terminology" in Proceedings of the 11th International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)
-
Resource-based WordNet Augmentation and Enrichment
In this paper we present an approach to support production of synsets for SerbianWordNet(SerWN)byadjustingPrincetonWordNet(PWN)synsetsusing several bilingual English-Serbian resources. PWN synset definitions were automatically translated and post-edited, if needed, while candidate literals for Serbian synsets were obtained automatically from a list of translational equivalents compiled form bilingual resources. Preliminary results obtained from a setof1248selectedPWNsynsetsshowthattheproducedSerbiansynsetscontain 4024 literals, out of which 2278 were offered by the system we present in this paper, whereas experts added the remaining 1746. Approximately one half of ...... phrasets. In Proceedings of the Tenth Conference on European Chapter of the Association for Computational Linguistics - Volume 2, EACL ’03, pages 67–70, Stroudsburg, PA, USA. Association for Computational Linguistics. Bentivogli, L., Forner, P., Magnini, B., and Pianta, E. (2004). Revising the wordnet domains ...
... Enrichment | Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev | Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria | 2018 | | http://dr.rgf.bg.ac.rs/s/repo/item/0002014 Дигитални репозиторијум ...
... balancing. In Proceedings of the Workshop on Multilingual Linguistic Ressources, MLR ’04, pages 101–108, Stroudsburg, PA, USA. Association for Computational Linguistics. Bhingardive, S., Ajotikar, T., Kulkarni, I., Kulkarni, M., and Bhattacharyya, P. (2014). Beyond lexical units: Enriching wordnets with phrasets ...Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev. "Resource-based WordNet Augmentation and Enrichment" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018)
-
FrameNet Lexical Database: Presenting a Few Frames Within the Risk Domain
U radu se daje kratak prikaz teorije semantike okvira, na kojoj je zasnovana leksička baza Frejmnet. Predstavljena je koncepcija ove mreže, kao i mogućnosti njene primene. Predstavljena je i leksička analiza koja se primenjuje u projektu izrade Frejmneta i ukazano na razlike između analize zasnovane na okviru u odnosu na analizu zasnovanu na reči. Zatim je prikazano nekoliko povezanih okvira koje prizivaju reči iz domena rizika. U radu je predstavljena i platforma NLTК pomoću koje se mogu koristiti ...... semantic roles.” Computational linguistics 28 (3): 245–288. Infotheca Vol. 21, No. 1, September 2021 31 Marković A. et al., FrameNet Lexical Database. . . , pp. 7–33 Hamilton, Craig, Svenja Adolphs, and Brigitte Nerlich. 2007. “The meanings of ‘risk’: A view from corpus linguistics.” Discourse & Society ...
... s of the Corpus Linguistics Conference, 14:17. Pradhan, Sameer, Wayne Ward, Kadri Hacioglu, James H Martin, and Dan Jurafsky. 2005. “Semantic role labeling using different syntactic views.” In Proceedings of the 43rd Annual Meeting of the Association for Com- putational Linguistics (ACL’05), 581–588 ...
... Atkins, Beryl T. S. 1994. “Analyzing the verbs of seeing: a frame semantics approach to corpus lexicography.” In Annual Meeting of the Berkeley Linguistics Society, 20:42–56. 1. 30 Infotheca Vol. 21, No. 1, September 2021 Scientific paper Atkins, Sue, Charles J Fillmore, and Christopher R Johnson ...Aleksandra Marković, Ranka Stanković, Natalija Tomić, Olivera Kitanović. "FrameNet Lexical Database: Presenting a Few Frames Within the Risk Domain" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.1.1
-
Multi-word Expressions for Abusive Speech Detection in Serbian
Ovaj rad predstavlja istraživanja na usavršavanju i unapređenju srpske verzije rečnika Hurtlex, višejezičnog leksikona uvredljivih reči. Posebnu pažnju posvećujemo dodavanju izraza sa više reči (polileksemskih jedinica) koji se mogu smatrati uvredljivim, jer su takvi leksički zapisi veoma važni za postizanje dobrih rezultata u mnoštvu zadataka otkrivanja uvredljivog jezika. Srpski morfološki rečnici se koriste kao osnova za čišćenje podataka i stvaranje rečnika. Istaknuta je veza sa drugim leksičkim i semantičkim resursima na srpskom jeziku i predviđena je izgradnja sistema za ...... and Duško Vitas. 2020. Analysis of similes in serbian literary texts (1840- 1920) using computational methods. In Svetla Koeva, editor, Proceedings of the Fourth International Confer- ence Computational Linguistics in Bulgaria (CLIB 2020). Institute for Bulgarian Language “Prof. Lyubomir Andreychin”, ...
... Elisa Bassignana, Valerio Basile, and Viviana Patti. 2018. Hurtlex: A multilingual lexicon of words to hurt. In 5th Italian Conference on Computational Linguistics, CLiC-it 2018, volume 2253, pages 1–6. CEUR-WS. Tommaso Caselli, Valerio Basile, Jelena Mitrović, Inga Kartoziya, and Michael Granitzer ...
... detection: A hybrid approach with deep learning and a multilingual lexicon. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, pages 363–370. Endang Wahyu Pamungkas, Alessandra Teresa Cignarella, Valerio Basile, and V. Patti. 2018. Automatic ...Ranka Stanković, Jelena Mitrović, Danka Jokić, Cvetana Krstev. "Multi-word Expressions for Abusive Speech Detection in Serbian" in Proceedings of the Joint Workshop on Multiword Expressions and Electronic Lexicons, Association for Computational Linguistics (2020)
-
Bilingual lexical extraction based on word alignment for improving corpus search
Jelena Andonovski, Branislava Šandrih, Olivera Kitanović. "Bilingual lexical extraction based on word alignment for improving corpus search" in The Electronic Library, Emerald (2019). https://doi.org/10.1108/EL-03-2019-0056